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(54) Abstract Title 

Tamper detection for video images 



(57) A method for testing (500, 600, 700) a presence of a 
watermark in a scalable video sequence, for detecting 
tampering of the scalable video sequence, includes 
receiving (502, 602) a scalable video sequence having a 
plurality of blocks, wherein a number of blocks include at 
least one watermark coefficient; performing a discrete 
cosine transform on said video sequence, thereby 
recovering a plurality of coefficients associated with one 
or more block(s); and comparing (506) a number of 
positive coefficients with a number of negative 
coefficients to determine whether said received scalable 
video sequence has been tampered with. 

A method for classifying a type of watermark attack 
on a scalable video sequence. This allows a determination 
to be made as to whether a scalable video sequence has 
been tampered with; and provides a good estimate as to 
the type of attack/tampering to the video sequence. 



502 



TEST FOR WATERMARK PRESENCE: 

" m 



STATISTICAL MEAN/VARIANCE TEST| 




FIG. 

500 



FALSE 



TEST FOR WATERMARK PRESENCE: L 
PEAK COMPARISON TEST [ 



512 



RECOVERY OF WATERMARKED i 
COEFFICIENTS [ 

^-508 




WATERMARK PROBABLY NOT 
PRESENT OR WATERMARK ATTACKED *5W 
AW DAMAGES 



BIT CLASSIFICATION 
AND RECOVERY 




USER MUST OECIDE WHETHER 
TO CONTINUE 



TRUE 



___ r^22 
TEST FOR HIGH-PASS i 
FILTERING OR SHARPENING J 



r 520 



ATTACK CLASSIFICATION TEST: 
« HIGH-PASS 

• LOW-PASS 

• RESAHPLE 
« NOW-LINEAR 



FAIL 



1 



PASS 



526 



ATTACK PROBABLY NOT CARRIED 
OUT OVER WHOLE SEQUENCE 



r 524 



SEQUENCE HAS PR03ABLY BEEN 
HIGH-PASS FILTERED 
OR SHARPENED 




TRUE 



534 



LOCALIZATION OF ATTACK: 
-POSITION IN FRAME(S) 
-POSITION IN TIME 
(FRAME REMOVAL) 



This print takes account of replacement documents submitted after the date of filing to enable the application to comply 
with the formal requirements of the Patents Rules 1995 



At least one drawing originally filed was informal and the print reproduced here is taken from a later filed formal 



G) 

CD 

ro 

CO 

**>i 
cn 

CJI 



copy. 



V8 



FIG. 1 



B 2 



ENHANCEMENT 
LAYER 



EI 



Ef 



FIG. 3 



BASE LAYER 



ENHANCEMENT 
LAYER 



EI 



EP 



FIG. 4 



t 



BASE LAYER 



r 



502 



3/8 



TEST FOR WATERMARK PRESENCE: 
STATISTICAL MEAN/VARIANCE TEST 



■504 



FIG. 5 

500 



■506 



X>50% 



510 



KEY 



TRUE 



FALSE 








V 






TEST FOR WATERMARK 


PRESENCE: 




PEAK COMPARISON 


TEST 



■512 



TRUE 



:COVERY OF WATERMARKED 
COEFF ICIENTS 

~" ^508 



530 




WATERMARK PROBABLY NOT 
PRESENT OR WATERMARK ATTACKED 
AND DAMAGED 



-516 



BIT CLASSIFICATION 
AND RECOVERY 



] 



USER MUST DECIDE WHETHER 
TO CONTINUE 




TRUE 



FALSE 



522 



1 



520 



ATTACK CLASSIFICATION TEST: 

• HIGH-PASS 

• LOW-PASS 

• RESAMPLE 

• NON-LINEAR 



TEST FOR HIGH-PASS 
FILTERING OR SHARPENING 

" IPASS™*™ 



FAIL 



526 



ATTACK PROBABLY NOT CARRIED 
OUT OVER WHOLE SEQUENCE 



1 



r 



524 



SEQUENCE HAS PROBABLY BEEN 
HIGH-PASS FILTERED 
OR SHARPENED 



532 



WORSE THAN 
JXPECTED RECOVERY. 

9 





FALSE 




r 536 


PROBABLY NO TAMPERING/ATTACK | 



1 



534 



LOCALIZATION OF ATTACK: 
-POSITION IN FRAME(S) 
-POSITION IN TIME 
(FRAME REMOVAL) 



4/8 



COMBINED BASE AND ENHANCEMENT 
OR ENHANCEMENT ONLY 
DCT COEFFICIENTS 



3 



SPLIT CEOFFICIENTS 
INTO GROUPS OF 990 



I 



SELECT COEFFICIENTS 



610 



SORT COEFFICIENTS 



612 



2l 



CALCU LATE MEANS 
v 



CALCULA TE VARIANCES 
v 



CALCULATE RATIOS 



616- 



618- 



TEST RA TIOS 



PASS 



COUNTER++ 




C0UNTER=100 * COUNTER / 
(4*(No. FRAMES)) 

I 

OUT 



\-614 



FAIL 



-622 



600 



-602 



} 



604 



~^608 



FIG. 6 



IN 

702-^ 



5, 



SPLIT COEFFICIENTS 
INTO GROUPS OF 990 

— 5 



-704 



708-^ 



SELECT COE FFICIENTS 

+ 



706 



CALCULATE MAXIMUM 
COEFFICIENT VALUE 



1 



CALCULATE MINIMUM 
COEFFICIENT VALUE 



710 



T 



7/4 



GROUP INTO FREQUENCY 
DISTRIBUTION WITH BINS FROM 
THE MINIMUM TO MAXIMUM 



CALCULATE POSITIONS 
OF LOCAL MAXIMA 



716^ 



1 



CALCULATE POSITIONS 
OF LOCAL MINIMA 



jr 1 



-7/2 



CALCULATE AREAS OF 
PEAKS BETWEEN MINIMA 



V7/0 



DISCARD SMALL PEAKS 



720 



CALCULATE AREAS OF 
EQUIVALENT-VE PEAKS 



J- 722 




FAIL 



COUNTER++ 



726 




700 



FIG. 7 



COUNTER 2 =100 * C0UNTER 2 / -730 
(4*(No. FRAMES)) 



6/8 



810 



FREQUENCY 



900 -i 
800 - 
700 - 
600- 
500 - 
400 - 
300 - 
200 - 
100 - 
0- 



830 



-50-40-30-20 -10 0 



-1 T^— I 1 1 

10 20 30 40 50 



COEFFICIENT VALUE 

^-820 



800 



FIG. 8A 



860 



FREQUENCY 



900 -i 
800 - 
700 - 
600 - 
500 - 
400 - 
300 - 
200 - 
100- 



880 



-50-40-30-20 -10 0 



10 20 30 40 50 



COEFFICIENT VALUE 

^870 



850 



FIG. 8B 



7/8 



50—i 
40- 



910 



} 30H 
FREQUENCY 

20- 
10 H 
0- 



-20 



r 



930 



NO FILTER 

HIGH PASS 



-940 



0 20 
COEFFICIENT VALUE 



40 



920 



~i 
60 



900 



wo 



FREQUENCY 



50-i 

45- 

40- 

35- 

30- 

25- 

20- 

15- 

10- 

5- 

0 — 
-20 



1030 



NO FILTER 

LOW PASS 



t\ 
n 
\ t\ 



1050 




1040 



40 



COEFFICIENT VALUE 

MO20 



1 
60 

1000 



FIG. 10 



8/8 



1110 



FREQUENCY 



50—i 
40- 
30- 
20- 
10 H 

o- 



1130 



NO FILTER 

RESAMPLE 




1140 



-20 



20 

COEFFICIENT VALUE 

^1120 



40 



— i 
60 



1100 



FIG. 11 



50 -i 
40 



1210 



30 

FREQUENCY 

20- 

10- 

0 



-20 




1230 



1250 



NO FILTER 
NON LINEAR 



1240 



i 1— 

20 40 

COEFFICIENT VALUE 

^1220 



7o 



1200 



FIG. 12 



2377575 

1 

WATERMARKING IN VIDEO TRANSMISSIONS 
Field of the Invention 

5 

This invention relates to video transmission systems and 
video encoding/decoding techniques and, in particular, to 
watermarking of video transmissions to enable tampering 
of the video transmission to be detected. The invention 
10 is applicable to, but not limited to, a video compression 
system where the video has been compressed using a 
scalable compression method. 

15 Background of the Invention 

Video is transmitted as a series of still 

images/pictures. Since the quality of a video signal can 
be affected during coding or compression of the video 

20 signal, it is known to include additional "layers" of 

transmission, based on the difference between the video 
signal and the encoded video bit stream, to enhance the 
quality of the received signal following decoding and/or 
decompression. Hence, a hierarchy of pictures and 

25 enhancement pictures partitioned into one or more layers 
is used, thereby resulting in a layered video bit stream 
being produced. 

A scalable video bit stream refers to the ability to 
30 transmit and receive video signals of more than one 

resolution and/or quality simultaneously. A scalable 
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video bit stream is one that may be decoded at different 
rates , according to the bandwidth available at the 
decoder. This enables the user with access to a higher 
bandwidth channel to decode high quality video, whilst a 
5 lower bandwidth user is still able to view the same 
video, albeit at a lower quality. 

The main reason that scalable video transmissions are 
used is for systems where multiple decoders with access 
10 to differing bandwidths are receiving images from a 
single encoder. 

In a layered scalable video bit stream, enhancements to 
the video signal may be added to the base layer either 
15 by: 

(i) increasing the resolution of the picture 
(spatial scalability) ; 

(ii) including error information to improve the 
Signal to Noise Ratio of the picture (SNR scalability) or 

20 (iii) including extra pictures to increase the frame 

rate (temporal scalability) . 

The term "hybrid scalability" implies using more than one 
of these types of scalable enhancements in the encoding 
25 of the video stream. 

Such enhancements may be applied to the whole picture, or 
to an arbitrarily shaped object within the picture, which 
is termed object-based scalability. In order to preserve 
30 the disposable nature of the temporal enhancement layer, 
the H.263+ standard dictates that pictures included in 
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the temporal scalability mode must be bi-directionally 
predicted (B) pictures. 

If a three layer video bit stream is used, the base layer 
5 (layer 1) will include intra-coded pictures (I pictures) 
sampled, coded or compressed from the original video 
signal pictures, and a plurality of predicted inter-coded 
pictures (P pictures) . In the enhancement layers (layers 
2 or 3 or more) , three types of picture may be used for 
10 scalability: bi-directionally predicted (B) pictures; 

enhanced intra (EI) pictures; and enhanced predicted (EP) 
pictures . 

The three basic known methods of scalability will now be 
15 explained. 

(i) Temporal Scalability 

Temporal scalability is achieved using bi-directionally 
20 predicted pictures, or B-pictures. B-pictures are 

predicted from previous and subsequent reconstructed 
pictures in the reference layer. Such reconstructed 
pictures generally result in improved compression 
efficiency as compared to that of P pictures. 

25 

B pictures are not used as reference pictures for the 
prediction of any other pictures. This allows for B- 
pictures to be discarded if necessary without adversely 
affecting any subsequent pictures, thus providing 
30 temporal scalability. FIG. 1 illustrates the predictive 
structure of P and B pictures. 



(ii) SNR Scalability 



A second method to achieve scalability is through 
spatial/SNR enhancement. Spatial scalability and SNR 
scalability are equivalent, except for the use of 
interpolation as is described shortly. SNR scalability 
refers to the process of improving the picture signal-to- 
noise-ratio (PSNR) of a base layer picture, by including 
additional error information in one or more enhancement 
layers . 

Because compression introduces artefacts and distortions 
into the video stream, the difference between a 
reconstructed picture and its original in the encoder is 
(nearly always) a non-zero-valued picture, containing 
what is generally termed the coding error. Normally, 
this coding error is lost at the encoder and never 
recovered . 

With SNR scalability, such coding error pictures can also 
be encoded and sent to the decoder, as shown in FIG. 2, 
producing an enhancement to the decoded picture. This 
additional error information may be used to either: 

(i) increase the spatial quality of the still or 
video image, or 

(ii) increase the spatial resolution of the picture 
(known as spatial scalability) , 
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and in both cases may be encoded in Enhancement Intra 
(EI) or Enhancement Predicted (EP) pictures - hence the 
term SNR scalability. 

5 FIG. 3 illustrates the data flow for SNR scalability. 

The vertical arrows from the lower layer illustrate that 
the picture in the enhancement layer is predicted from a 
reconstructed approximation of that picture in the 
reference (lower) layer. 

10 

FIG . 2 shows a schematic representation of an apparatus 
for conducting SNR scalability. In FIG. 2, a video 
picture F 0 is compressed and discrete cosine transformed, 
as is known in the art, at 1 to produce the base layer 
15 bit stream signal to be transmitted at a rate ri kbits per 
second (kbps) . This signal is decompressed at 2 to 
produce the reconstructed base layer picture F 0 1 . 

The compressed base layer bit stream is also decompressed 
20 at 3 and compared with the original picture F 0 at 4 to 

produce a difference signal 5. This difference signal is 
compressed at 6 and transmitted as the enhancement layer 
bit stream at a rate r 2 kbps. This enhancement layer bit 
stream is decompressed at 7 to produce the enhancement 
25 layer picture F 0 ' ' which is added to the reconstructed 
base layer picture F 0 1 at 8 to produce the final 
reconstructed picture F 0 '". 

If prediction is only formed from the lower layer, then 
30 the enhancement layer picture is referred to as an EI 

picture. It is possible, however, to create a modified 
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bi-directionally predicted picture using both a prior 
enhancement layer picture and a temporally simultaneous 
lower layer reference picture. This type of picture is 
referred to as an EP picture or "Enhancement" P-picture. 
5 The prediction flow for EI and EP pictures is shown in 
FIG. 3. (Although not specifically shown in FIG. 3, an 
EI picture in an enhancement layer may have a P picture 
as its lower layer reference picture, and an EP picture 
may have an I picture as its lower-layer enhancement 
10 picture . ) 

For both EI and EP pictures, the prediction from the 
reference layer uses no motion vectors. However, as with 
normal P pictures, EP pictures use motion vectors when 
15 predicting from their temporally, prior-reference picture 
in the same layer. 

(iii) Spatial Scalability 

20 The third well-known scalability method is spatial 

scalability, which is closely related to SNR scalability. 
The only difference is that before the picture in the 
reference layer is used to predict the picture in the 
spatial enhancement layer, it is interpolated by a factor 

25 of two either horizontally or vertically (1-D spatial 
scalability) , or both horizontally and vertically (2-D 
spatial scalability) . Spatial scalability is shown in 
FIG. 4. 
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In the field of this invention, it is known that 
individual images/pictures, or a series of images say, in 
a transmitted video stream, may be tampered with. It is 
also known that a need exists to protect such images or 
5 video stream from such undesirable tampering. One known 
technique employed to protect still images or documents 
is by the use of "watermarks". 

Ruanaidh, Dowling and Boland "Perceptual Watermarks of 
10 Digital Images", Proceedings of the IEEE INTCONF Image 
Processing, describes a technique for watermarking 
digital I frames in an MPEG sequence. 

In image watermarking, a known binary pattern or 
15 signature is embedded into an image at the moment of 

image acquisition. Such watermarks are termed "robust", 
because they are designed to remain intact regardless of 
any post-processing of the image such as filtering, 
cropping, etc, 

20 

While such watermarks do provide a useful degree of 
protection to still images, they cannot, at present, be 
wholly relied upon to protect other forms, for example 
documents . 

25 

As a digital video stream is a series of digital images, 
the concept of watermarking still images has been 
considered for extension to video technology. R. 
Wolfgang, C. Podilchuk and E. Delp - "Perceptual 
30 Watermarks of Digital Images", Proceedings of the IEEE 
INTCONF Image Processing - describes a technique for 
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watermarking discrete cosine transform (DCT) block-based 
video, namely digital I frames in an MPEG sequence. 

This paper describes the concept of embedding a watermark 
into an image in the transform domain (e.g. using DCT or 
wavelet coefficients) . After the appropriate transform 
has been applied to the image (e.g. block based DCT), a 
coefficient X(u,v) is modified according to the following 
equation : 

Y(u,v) = X(u,v) + J(u,v)W(u,v) [1] 
if |X(u,v) |>J(u,v) [2] 
else Y(u,v) = X(u,v) [3] 

Where W(u,v) is the watermark sequence and J(u,v) is a 
parameter indicating the "just noticeable difference" for 
each transform coefficient. This method only allows 
detection of the presence of the watermark, and requires 
the original image for detection. 



20 



In the scenario of a scalable video system where the base 
layer is freely available, but the enhancement layer 
requires extra capacity, there lies a need to be able to 
protect the enhancement layer from unlawful copying and 
distribution. For example, there could be a case where 
both the base and enhancement layers have been obtained 
legitimately, and have been recombined, but then 
illegally distributed. 
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The inventors of the present invention believe that no 
work has previously been carried out in the area of 
watermarking scalable video other than by Motorola. 
Furthermore, to the inventors' knowledge, no work has 
5 been carried out on tamper evidence for scalable video 
other than by Motorola. 

A study has been carried out on tamper evidence and 
authentication of MPEG video by C Lin & S Chang - "Issues 
10 and Solutions for Authenticating MPEG Video", SPIE 

Security and Watermarking of Multimedia Contents, EI '99, 
San Jose, CA. 

This tamper evidence technique cannot be applied to SNR 
15 scalable video as it would inherently destroy any visual 
improvements that the SNR enhancement layer brings. 
Also, this technique does not assess the full range of 
attacks that could take place, but only cut and paste 
operations within the frames. 

20 

Consequently, the technique fails to resolve the problems 
in a scalable video system where the base layer is freely 
available, but the enhancement layer requires extra 
capacity. In such a system, there lies a need to be able 
25 to protect the enhancement layer from tampering. 

A further consideration to a video transmission system 
designer is that if a pirate is aware that the video 
sequence is watermarked, (s)he may try to attack the 
30 whole sequence as to render the watermark impossible to 
recover. If, later, the tampering is discovered, it 



10 

would be required to demonstrate whether the illegally 
distributed sequences have the watermark present. This 
would be accomplished by recovering the watermark or at 
least demonstrating that the watermark is still partially 
5 present within the sequence, but that recovery has been 
made impossible by any attacks carried out on the 
sequence . 

In the scenario where the scalable video is being used 
10 for say, a surveillance system, it may be required to 
prove that what is shown on the video sequence did 
actually occur, and that no tampering took place. If 
tampering did occur, it may be required to demonstrate 
the exact locations in the sequence that had been 
15 tampered with. 

Protection of digital media (including image and video) 
has also become a key standardisation topic within the 
multimedia industry over the last year. Police users 
20 have formally stated that they do not see a possibility 
of using digitally transmitted and processed images for 
evidential purposes without the existence of reliable 
tamper detection methods. 

25 The European Broadcasting Union has issued a second call 
for systems for watermarking multimedia for entertainment 
applications. In addition, MPEG has started a new 
working group called MPEG21, whose essential function 
relies on digital rights management, which includes 

30 authentication of multimedia data. 
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In summary, there exists a need in the field of video 
transmission, and in particular in video transmissions 
that require authentication of the video image, for an 
5 improved arrangement of detecting and quantifying any 
tampering of a video sequence or image, where the 
abovementioned disadvantages with prior art arrangements 
may be alleviated. Furthermore, if such tampering is 
found, there is a need to localise and classify the 
10 tampering attack. 

Statement of Invention 

In accordance with a first aspect of the present 
15 invention there is provided a method for testing a 

presence of a watermark in a scalable video sequence, as 
claimed in claim 1. 

In accordance with a second aspect of the present 
20 invention there is provided a method for classifying a 

type of watermark attack on a scalable video sequence, as 
claimed in claim 11. 

In accordance with a third aspect of the present 
25 invention there is provided a video transmission system, 
as claimed in claim 15. 

In accordance with a fourth aspect of the present 
invention there is provided a video communication unit 
30 adapted to operate in the aforementioned video 
transmission system, as claimed in claim 18. 
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In accordance with a fifth aspect of the present 
invention there is provided a video communication unit 
adapted to operate any of the method steps of claim 1 or 
5 claim 11, as claimed in claim 19. 

In accordance with a sixth aspect of the present 
invention there is provided a mobile radio device 
comprising the aforementioned video communication unit, 
10 as claimed in claim 20. 

Brief Description of the Drawings 

15 FIG. 1 is a schematic illustration of a known scalable 
video arrangement showing B picture prediction 
dependencies . 

FIG. 2 is a schematic representation of a known scalable 
20 video arrangement for undertaking SNR scalability. 

FIG. 3 is a schematic illustration of a known scalable 
video arrangement showing a base layer and an enhancement 
layer produced using SNR scalability. 

25 

FIG. 4 is a schematic illustration of a known scalable 
video arrangement showing a base layer and an enhancement 
layer produced using spatial scalability. 
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Exemplary embodiments of the present invention will now 
be described, with reference to the accompanying 
drawings, in which: 

5 FIG. 5 shows a flowchart of a method for detecting the 
tampering of a watermarked video signal, in accordance 
with a preferred embodiment of the present invention. 

FIG. 6 shows a flowchart of a method for detecting the 
10 tampering of a watermarked video signal using a 

statistical mean/variance test, in accordance with a 
preferred embodiment of the present invention. 

FIG. 7 shows a flowchart of a method for detecting the 
15 tampering of a watermarked video signal using a peak 
comparison test, in accordance with a preferred 
embodiment of the present invention. 

FIG. 8 shows two graphs of frequency distribution of 990 
20 Coefficients: graph 8a for un-watermarked coefficients 
and graph 8b watermarked coefficients, used in a method 
for detecting the tampering of a watermarked video 
signal, in accordance with a preferred embodiment of the 
present invention . 

25 

FIG. 9 shows a graph highlighting the effect of high-pass 
filtering on the frequency distribution of 990 
Coefficients, in accordance with a preferred embodiment 
of the present invention. 
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FIG. 10 shows a graph highlighting the effect of low-pass 
filtering on the frequency distribution of 990 
Coefficients, in accordance with a preferred embodiment 
of the present invention. 

5 

FIG. 11 shows a graph highlighting the effect of re- 
sampling/cropping and re-sampling type attacks on the 
frequency distribution of 990 Coef f icients , in accordance 
with a preferred embodiment of the present invention. 

10 

FIG. 12 shows a graph highlighting the effect of common 
non-linear filtering attacks on the frequency 
distribution of 990 Coefficients, in accordance with a 
preferred embodiment of the present invention. 

15 

Description of Preferred Embodiments 

This invention applies to both SNR and spatial scalable 
20 encoded video, although the description that follows 

refers mainly to SNR scalability for convenience. The 
inventive concepts herein described find particular 
application in the current MPEG technology arena 
developing a standard watermarking system for video use. 
25 The detection of tampering and the ability to determine 
what type of tampering has taken place are necessary 
steps in ensuring user confidence in the images and 
videos that they are viewing in an increasingly hostile 
multimedia communication environment . 

30 
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In summary, the preferred embodiment of this invention 
uses a series of new tests to make a decision as to 
whether the scalable video sequence has been tampered 
with. In addition, the tests are designed to provide a 
good estimate as to the type of attack/tampering 
performed and the location of the attack (whether it be 
the whole video sequence or a single frame) . 

Estimation of the type of attack is necessary so as not 
to confuse an attack with a legitimate enhancement of the 
sequence, e.g., by high-pass filtering to bring out small 
details in each image. 

The first two tests, namely the statistical mean/variance 
test and peak comparison test have the useful side effect 
that they provide a quick way of verifying the presence 
of the watermark without requiring a vx key" to extract the 
watermark . 

The tests provide further benefit in that they do not 
disclose any information about the location or contents 
of the watermark. Such a benefit is consistent with the 
concept of a 'Public 1 watermarking system whereby anyone 
can verify the presence of a watermark, but only the key- 
holder can extract and read the watermark. 

Both of these tests have the same mathematical basis, 
although the tests themselves are different. These two 
tests are based on the fact that in an un-watermar ked 
sequence, you would expect there to be as many rounding- 
down errors as rounding-up errors of the discrete cosine 
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transform (DCT) coefficients in the base layer, due to 
the quantiser design. This implies that in the 
enhancement layer, there are as many positive 
coefficients as negative (assuming coefficients equal to 
zero are neither positive or negative). 

However, using the watermarking system according to the 
preferred embodiment of the present invention, the 
relationship changes slightly, inasmuch as there are 
slightly more positive coefficients than negative. The 
two tests use this principle to decide on the presence of 
a watermark. 

Known video formats include common intermediate format 
(CIF) and quarter common intermediate format (QC1F) . A 

QCIF image has 99 macro-blocks, each comprising four 
(8*8) blocks. 

In the preferred embodiment of the present invention, in 
each frame of a watermarked QCIF video there are 396 
watermarked coefficients (one per 8*8 block) . Indeed, it 
is within the contemplation of the invention that any 
number up to 396 coefficients may be used. However, for 
a typical QCIF image, in accordance with the preferred 
embodiment of the invention, the watermark may have a 
length of 99 binary bits, with the 99 binary bits perhaps 
representing a 9*11 binary image, repeated 4 times. 

There are ten possible locations for these coefficients 
in each 8*8 block (actual location chosen by a user key 
and random number generator) . The inputs for both tests 



are the ten possibly watermarked coefficients from each 
8*8 DCT block within each frame in the sequence. 

Referring first to FIG. 5, a flowchart 500 of a method 
for detecting the tampering of a watermarked video signal 
is shown, in accordance with a preferred embodiment of 
the present invention. In particular, the block diagram 
demonstrates the ordering of a series of new tests, in 
order to make a decision as to whether the scalable video 
sequence has been tampered with. 

An input video signal 502, having been passed through a 
discrete cosine transform function (not shown) , is 
received and a determination is required as to whether 
the video signal has been tampered with. The video 
signal is applied to a statistical mean/variance test 504 
that tests for the presence of a watermark. This test is 
described in greater detail with regard to FIG. 6. 

The output from the statistical mean/variance test 504 
produces "X", which is the probability that the sequence 
has been watermarked. In each frame of the watermarked 
QCIF video data stream, a statistical mean/variance 
decision is made, as shown in step 506. This decision 
primarily checks to see if the result from the 
mean/variance test is greater than 50%. If the result is 
greater than 50%, the sequence is assumed to contain a 
watermark and the second test for a presence of a 
watermark may be skipped. If there are more positive 
coefficients (X > 50%) in the ten times 396 watermarked 
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coefficients in step 506, then the watermarked 
coefficients are recovered in step 508 using a key 510. 

If there are not more positive coefficients in the ten 
5 times 396 watermarked coefficients in step 506, then a 

second test for the presence of a watermark is preferably 
performed using the peak comparison test, as shown in 
step 512. Clearly, a skilled artisan would recognise 
that if there were more negative coefficients, an attack 
10 is likely to have been made. This test is described in 
greater detail with respect to FIG. 7. 

If the peak comparison decision yields a (Y>50%) result 
in step 514, then recovery of the watermarked 

15 coefficients is performed, as shown in step 508, using 
the key 510. The output from the peak comparison test 
512 produces "Y" which is the probability that the 
sequence has been watermarked. If the peak comparison 
test fails to yield a (Y>50%) result, then it is assumed 

20 that a watermark is probably not present, or the 

watermark has been attacked and is damaged, as in step 
516. 

This decision basically checks to see if the result from 
25 the peak comparison test is over 50%. If the result is 
over 50%, the image is assumed to contain a watermark. 
If the result is under 50%, the image is assumed to 
either not contain a watermark, or have had a major 
attack to the watermark, resulting in poor 
30 recover ability . 
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If the sequence fails this test, the user may decide 
whether to continue with the testing, as shown in step 
508, using, the key 510. In most cases the testing should 
continue, because, it is possible that evidence that the 
5 sequence originally contained a watermark can be gained 
from attempting to recover the watermark in step 508, and 
looking at the recovered watermark patterns. 

The operation of recovering the watermarked coefficients, 
10 in step 508, is identical to the recovery of the 

watermarked coefficients as defined by the watermarking 
method. Given the user key 510 that was used in the 
embedding of an original watermark in the encoder, a 
number is generated between one and ten for every 8*8 
15 block in the sequence. Although the preferred embodiment 
of the present invention is described with reference to 
using up to ten coefficients within each 8*8 block, a 
skilled artisan will recognise that the inventive 
concepts described herein could clearly apply to other 
20 configurations . 

This number corresponds to a location of the watermarked 
coefficient in the block. The watermarked coefficients 
can thus be read by selecting the correct coefficient 
25 from each block based on the numbers generated from the 
key 510. 

If, after performing the statistical mean/variance test 
504 and possibly the peak comparison test 512, the 
30 recovery of the watermarked coefficients is performed. A 
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whole sequence attack determination is made by 
determining whether (X or Y) <85%, as shown in step 518. 

The whole sequence attack determination, in step 518, is 
performed to decide whether the whole sequence has been 
filtered. The test is simple and basically checks to see 
whether the "X" or "Y" result from the peak comparison 
test of step 512, or the mean/variance test 504 (if the 
peak comparison test 512 was not performed where "X" is 
the only test result available) is over a certain 
threshold. Experiments have shown that a preferred 
threshold value of 85% works well, as shown in Table 1. 

If the result is over this threshold, then this implies 
that there was either not an attack, or there was a high- 
pass filter type attack, to the whole sequence. If this 
determination of (X or Y) < 85% is found to be true, then 
an attack classification test is performed, as shown in 
step 520, to classify the kind of attack. 

The classification process involves taking the 99 
recovered coefficients from each repeat of the watermark, 
and grouping them into a frequency distribution. By 
assessing the shape of the distribution, the type of 
attack can be classified into one of; high-pass, low- 
pass, re-sample/crop and re-sample, or non-linear. The 
effect on the frequency distribution of each of these 
attack classifications is described below with reference 
to FIG's 9 to 12. 
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High-pass : 

The high-pass filter attack can be modelled by taking 
each DCT coefficient and multiplying it by some variable, 
based on the respective frequency of the coefficient, and 
5 then adding it back to itself. FIG. 9 shows a graph 900 
highlighting the effect of high-pass filtering on the 
frequency distribution of 990 coefficients, in accordance 
with a preferred embodiment of the present invention. 
The graph 900 shows frequency distribution 910 plotted 
10 against coefficient value 920, with the primary 

distribution of values 930 being centred around "zero". 
As can be seen, the filter has the effect of increasing 
the range of the frequency distribution response 950, 
compared to the non-filtered values 940, as illustrated. 

15 

Low-pass : 

The low-pass filter has the opposite effect of the high- 
pass filter, and causes the data to bunch up, thereby 
reducing the frequency distribution range. FIG. 10 shows 

20 a graph 1000 highlighting the effect of low-pass 
filtering on the frequency distribution of 990 
coefficients, in accordance with a preferred embodiment 
of the present invention. The graph 1000 shows frequency 
distribution 1010 plotted against coefficient value 1020, 

25 with the primary distribution of values 1030 being 
centred around "zero". 

As can be seen, the filter has the effect of decreasing 
the range of the frequency distribution response 1050, 
30 compared to the non-filtered values 1040, as illustrated. 
If the filter is strong enough, the two peaks 1030 and 
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1050 may merge into a single peak with sidebands on the 
positive side, as illustrated in FIG. 10. It is 
important to note that the height of the peak at "zero" 
increases dramatically with the application of a low-pass 
5 type filter. 

Re-sample/Crop and Re-sample attacks: 

The re-sample/cropping and re-sampling-type attacks are 
performed by re-sampling each frame to a different size 

10 and then back to the original size. Alternatively, each 
frame is cropped and re-sampling back to the original 
size. FIG. 11 shows a graph 1100 highlighting the effect 
of re-sample/cropping and re-sampling-type attacks on the 
frequency distribution of 990 coefficients, in accordance 

15 with a preferred embodiment of the present invention. 

The graph 1100 shows frequency 1110 plotted against 
coefficient value 1120, with the primary distribution of 
values 1130 being centred around >v zero". As can be seen, 

20 the filtering associated with this kind of attack has the 
effect of jumbling up the coefficients in each block, as 
shown in response 1150, as compared to the non-filtered 
values 1140. The primary indication is that they appear 
not to contain a watermark. Hence, one can readily 

25 classify an attack as a re-sample/crop type attack when 
locations of the peaks in the frequency distribution of 
the coefficients are differently distributed to that 
expected . 
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Non- linear filtering : 

Referring now to FIG. 12, a graph 1200 highlighting the 
effect of non-linear filtering on the frequency 
distribution of 990 coefficients, in accordance with a 
5 preferred embodiment of the present invention, is shown. 

The graph 1200 shows frequency 1210 plotted against 
coefficient value 1220, with the primary distribution of 
values 1230 being centred around "zero". As can be seen, 
10 the effect of a non-linear filter is to create new 

frequencies within the image, which is seen as a combined 
grouping and spreading of the frequency distribution as 
shown in response 1250, as compared to the non-filtered 
values 1240. 

15 

This effect depends upon the filter used. An important 
characteristic to note is the reduction in height of the 
peak around "zero", created by the filtered coefficient 
values. Hence, in summary, one can readily classify an 
20 attack as a non-linear attack when peaks in the frequency 
distribution of the coefficients tend more towards a 
series of small peaks, with a primary peak located around 
"zero" being consequently reduced in height/area. 

25 Such a variety of filtering effects ensure that it is 
virtually impossible to actually recover the whole 
watermark as the positions of the watermarked 
coefficients will have moved, and the positions generated 
by the key no longer match with the watermarked 

30 coefficients. However, because the watermarked 
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coefficients are still present, the sequence still passes 
the peak comparison test for watermark presence. 

If the determination of whether (X or Y <85%) in step 518 
5 is found to be false, then a test for high-pass filtering 
or sharpening is performed, as shown in step 522. 

The test for high-pass filtering or sharpening, as shown 
in step 522, is identical to the test described above for 

10 high-pass filtering. The test is only performed if the 

likelihood is that the sequence either has been high-pass 
filtered or all of the frames of the video sequence have 
not been subjected to tampering. If the test indicates 
that the sequence has not been high-pass filtered, the 

15 attack is deemed to have been carried out over less than 
the whole video sequence, as highlighted in step 526. If 
high-pass filtering or sharpening is viewed as a 
legitimate form of enhancement, then this test need not 
be carried out. 

20 

If the video signal is found to fail the high-pass 
filtering or sharpening test, as shown in step 522, the 
sequence has probably been high-pass filtered or 
sharpened, as highlighted in step 524. 

25 

Concurrently with the determination of whether (X or Y 
<85%), a bit classification and recovery test is 
preferably performed, as shown in step 530. 

30 The bit-classification and recovery operation is 
performed in the same manner as described in the 



watermarking recovery method. Basically each coefficient 
is tested to decide on whether it contains a binary "1" 
or "0", based on whether the coefficient value lies 
between certain thresholds, namely whether the 
coefficient recovery is as expected or worse than 
expected, as shown in step 532. A means and method for 
generating an expected value is described in co-pending 
UK patent application (publication number 0031085.4). 

If the coefficient value lies outside the threshold ( s ) , 
then the bit is said to be in error, as its true value 
cannot be decided. The binary pattern from each of the 
four repeats in each frame is combined to give an error- 
corrected reconstruction of the watermark in that frame. 

In order to test the recovery rates of each frame, a 
method is required that looks at each bit and decides 
whether it is correct or not. If the original embedded 
watermark pattern is known, then the recovered watermark 
from each frame can be subtracted from the original 
watermark, and the number of bits in error can be summed. 

If the original watermark was not available, and each 
frame in the sequence was watermarked with the same 
watermark pattern, then a highly accurate representation 
of the original watermark could be created by combining 
all of the recovered watermarks within the sequence. 
This representation of the original watermark could then 
be used to calculate the bit error rate(s). 
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If each frame of the sequence were to be watermarked with 
a different watermark, then the preferred way to 
calculate an error rate would be to subtract the 
recovered watermark from each repeat of that watermark in 
5 the frame. Again, this could be used to calculate a bit 
error rate. It is worth noting that this is the most 
accurate method from the three listed, as a small tamper 
may not affect the recovered watermark, whilst it would 
affect one of the repeats. 

10 

The bit error rate calculated for each frame must then be 
compared to expected values, to decide whether the frame 
may have been tampered with. Based on experimentation, 
the inventors have found that if the watermark was cast 

15 with a strength of "nine", as known to those skilled in 
the art, then less than one error from the recovered 
watermark in seventy frames would be expected. 
Obviously, if the bit error rate (BER) were calculated 
from the bits in error from the individual repeats, then 

20 the expected BER would be higher. 

If the recovery is found to be as expected in step 532, 
it is deemed that the video sequence/video signal has 
probably not been subjected to an attack or any 
25 tampering, as shown in step 536. 

However, if the recovery is found to be worse than 
expected in step 532, it is deemed that the video 
sequence/video signal has been subjected to an attack or 
30 tampering. Preferably, a positioning determination in 

either the frame or in time is performed to determine the 
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location of the attack in the video sequence/video 
signal . 

If the video sequence started with a good recovery rate, 
5 and suddenly dropped to a poor /non-existent recovery 

rate, it may be assumed that the attacker has removed one 
or more whole frames (or, possibly inserted frames) , 
causing the actual locations of the watermarked 
coefficients to become out of synchronisation with the 
10 positions predicted by the user key. 

This doesn't actually affect any of the tests for the 
presence of a watermark, as the watermark is still in the 
sequence, just not in the expected location. In order to 
15 find out how many frames the attacker may have removed, 

'dummy' frames can be inserted, at the point the recovery 
dropped off, until the recovery rates return to normal - 
the number of frames removed by the attacker would be 
equal to the number of 'dummy' frames. 

20 

If the recovery for a certain frame, or frames, is lower 
than expected, then it may be assumed that the frame or 
frames have been tampered with. The location of any 
tampering can be approximated by mapping each watermark 
25 bit to its original location within the frame when it was 
embedded . 

Referring now to FIG. 6 a flowchart 600 of a statistical 
mean/variance test is shown in accordance with a 
30 preferred embodiment of the present invention. This test 
is particularly useful as it is reasonably invariant to 
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certain types of filter, such as a high-pass, or a 
sharpening filter . 

The Statistical mean/variance test in step 504 of FIG. 5 
is also fairly simple, in that it requires relatively few 
passes over the data to obtain the result. The test is 
based on the fact that if there is no watermark, the 
variance and absolute mean (abs (mean) ) of the negative 
coefficients should be equal to the variance and the 
abs (mean) of the positive coefficients. The method is as 
follows : 

The input video signal comprises either a combined 'base' 
and 'enhancement' layer of the scalable video sequence or 
an 'enhancement' -only layer. The signal 602 input to the 
testing function (s) described above is a discrete cosine 
transform function of this input video signal. 
Partitioning of the bit-stream into 'base' plus 
'enhancement' is only a viable technique where the 
transmission or communication system guarantees the 
reception of the base layer. 

The DCT of the video signal 602 is input into a function 
where the coefficients are split into groups of 990, as 
shown in step 604. The groups correspond to the ten 
possibly watermarked coefficients in each block per 99- 
bit repeat of the watermark. 

In the preferred embodiment of the invention, a selected 
watermark is only applied to the luminance of the 
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enhancement layer. Preferably, each 8x8 coefficient 
block conceals one watermark bit. 

As the discrete cosine transform (DCT) is additive: 

5 

DCT [A] + DCT [ B ] = DCT [A+B ] [4] 

any changes to coefficients in the enhancement layer may 
be determined from a combined 'Base + Enhancement' layer. 

10 

Each 8x8 DCT coefficient block, comprising 990 
coefficients, is then applied to the coefficient selector 
function, as shown in step 606. 

15 The selected coefficients 606 are then sorted into 

ascending or descending order, as in step 608. The mean 
610 and variance 612 of all the coefficients less than 
"zero" and greater than "zero'' are calculated as shown in 
equations [5] and [6] below. 

20 

Mean = m { =— YV [5] 
n 1 

Variance = crj = — [ 6 ] 

n 

The mean of the negative <<"0") side is then divided by 
25 the mean of the positive (>"0") side to obtain the ratio 
of means, as shown in step 614 and equation [7] below. 

Ratio of means - m y— [ 7 ] 

J /^2 



10 



15 



20 



The variance of the negative (<"0") side is divided by 

the variance of the positive (>"0") side to obtain the 

ratio of variance, as shown in step 614 and equation [8] 
below . 



If the: 

'"ratio of means" (in step 614) >= y A' ; 

where (A--0.6); and 
"ratio of variance" <= 'B' 

where (B-0.6); when testing the ratios in 
step 616, then the sequence is assumed to be watermarked. 
A counter is then incremented, as show in step 618 and 
equation [9] below. If the test fails, and it is 
therefore determined that a watermark is not present, the 
^counter' step is by-passed, as shown. 



The method then repeats, if it has not completed the 
process for all 990 coefficients, as shown in step 620, 
until the whole sequence has been tested. The counter is 
then preferably divided by four, and then divided by the 
number of frames and multiplied by 100, as shown in step 
622 and equation [10] below. 




[8] 




[9] 



31 



COUNTER = 100 * COUNTER / (4 * (2Vb. Frames)) [10] 

This provides a result of the percentage probability that 
the sequence is watermarked. 

5 

Referring now to FIG. 7, a flowchart 700 of a method for 
detecting the tampering of a watermarked video signal 
using a peak comparison test, as in step 512 of FIG. 5, 
is shown, in accordance with a preferred embodiment of 
10 the present invention. 

This test is slightly more computationally intensive than 
the statistical test, but provides better results over a 
wider range of attacks to the sequence. 

15 

The basis for the method is that if the 990 coefficients, 
from each repeat of the watermark, are grouped into a 
frequency distribution, with equal sized bins of one, 
they will have a unique shape of distribution based on 
20 whether they carry a watermark or not. 

Computationally, the peak comparison test method is 
performed by the input coefficients being split into 
groups of 990 corresponding to the ten possibly 
25 watermarked coefficients in each block, per 99-bit repeat 
of the watermark, as shown in step 702. 
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The first 990 coefficients are selected, as shown in step 
706. The minimum 710 and maximum 708 coefficient values 
are calculated. The frequency distribution is 
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calculated, as in step 712, for all values between the 
minimum and maximum. 

The locations of all the local maxima 714 are calculated 
5 for coefficient values between "zero" and the maximum. 

The locations of the two local minima 716 associated with 
each maxima is calculated. The area of each peak is 
calculated by summing the frequencies between the two 
local minima associated with each maxima, as shown in 
10 step 718. 

Small peaks with areas < C (for example where 02) are 
discarded, noting that calculation errors and certain 
types of attack can sometimes cause extra noise. 

15 

The areas of the equivalent negative peaks are 
calculated, as in step 722, by summing the frequencies 
between * the positions of the two local minima 

associated with each positive peak. The area of each 
20 positive peak is compared with its equivalent negative 
peak, as shown in step 724. If the positive peak has a 
much greater area, then this indicates the presence of a 
watermark. A counter is incremented, as shown in step 
726 and equation [11] below. 

25 

IF peak-h > A * peak- THEN COUNTER++ [11] 

The counter is divided by the number of positive peaks, 
in order to normalise it. The value of the first counter 
30 is then added to a second counter, which sums the results 
over the whole sequence. If more frames of the video 
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sequence/signal are received,, the process is not finished 
in step 728 and the process then repeats for subsequent 
watermark repeats and frames, as in step 706. 

5 Alternatively, if the process is finished in step 728, 
the second counter is finally divided by four, then 
divided by the number of frames and multiplied by 100 to 
give a percentage probability of watermark presence, as 
shown in step 730 and equation [12] below. 

0 

COUNTERS = 100 * C0UNTER 2 / ( '4* (No . Frames ) ) [12] 



Referring now to FIG. 8, graph 8a shows the frequencies 
15 of 990 coefficients without a watermark 800, and Graph 8b 
shows the coefficients with a watermark 850. Graphs 8a 
and 8b are shown with frequency 810, 860, plotted against 
co-efficient value 820, 870. 

20 In an un-watermar ked sequence, it is expected that there 
will be approximately equal amounts of rounding-down to 
those of rounding-up. Consequently, it is expected that 
the peaks on the negative side of the graph will have the 
same areas as the corresponding peak on the positive side 

25 (the quantiser ensures that the locations of the peaks on 
either side of "zero" are equal) . 

However, if the sequence has been watermarked, it is 
expected that at least one of the peaks on the positive 
30 side will be much larger than its equivalent peak on the 
negative side. 
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Table 1 shows a series of results from both of the tests 
for determining the presence of a watermark. The results 
show how the two tests react to various kinds of 
5 tampering applied to all frames within the sequence. 

Table 1: Watermark Presence Results 





WITH WATERMARK 


WITHOUT WATERMARK 


Tamper 


Statistical 


Peak 


Statistical 


Peak 




mean / 


comparison 


mean / 


comparison 




variance 


Test 


variance 


Test 




Test 




Test 




NONE 


96% 


98% 


0% 


35% 


Crimmins ' 


0% 


77% 


0% 


35% 


Noise 










Removal 










3*3 Median 


3% 


73% 


0% 


35% 


Sharpen 


98% 


98% 


0% 


35% 


(High-Pass) 










Low-Pass 


45% 


75% 


0% 


5% 


Resample 


0% 


54% 


0% 


7% 


(bi-linear ) 











The inventors have found through extensive testing that, 
when the watermark is reconstructed from the four 
repeats, the reconstruction is identical to the original 
watermark. This means that an even better approximation 

15 to the area affected by the tampering can be found by 
repeating the reconstructed watermark four times and 
mapping it back to the locations in the frame. The 
mapping of the four repeats is then subtracted to form an 
image showing the locations of the attack within the 

20 frame. 
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It is within the contemplation of the invention that the 
inventive concepts described herein can be applied to any 
video format such as CIF, 4CIF and subQCIF, etc. and is 
not therefore limited to the QCIF format described above. 

5 

In order to improve the accuracy of the results from the 
'peak comparison 1 test or the statistical 'mean/variance 1 
test when a re-sampling style attack is suspected,, the 
number of coefficients selected from each DCT block could 

10 be increased to include those coefficients that would not 
normally have a watermark applied. This improves the 
accuracy of the result as the effect of a re-sampling 
type attack tends to be that the DCT coefficients are 
moved within the frame. A watermarked coefficient would 

15 not necessarily be moved to a location that would 

normally be in the ten possible watermark locations. 
Thus, expanding the search area should improve the 
results, whilst not altering the validity of the method. 

20 As mentioned previously, either of the two tests for 
watermark presence could be built into a stand-alone 
application. If this application were publicly 
available, then the method would become a so-called 
'public' watermarking system, as anyone could use the 

25 application to test their media for presence of a 

watermark. However, the application would not give away 
any information on the watermark' s contents or location 
of the watermark. The privately known user key would be 
required to locate and recover the actual watermark. 
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It is within the contemplation of the present invention 
that the aforementioned inventive concepts may be applied 
to any video communication unit and/or video transmission 
system. In particular, the inventive concepts find 
5 particular use in wireless (radio) devices such as mobile 
telephones / mobile radio units and associated wireless 
communication systems. Such wireless communication units 
may include a portable or mobile PMR radio, a personal 
digital assistant, a lap-top computer or a wirelessly 
10 networked PC, 

Scalable video system technology may be implemented in 
the 3 rd generation (3G) of digital cellular telephones, 
commonly referred to as the Universal Mobile 

15 Telecommunications Standard (UMTS) . Scalable video 
system technology may also find applicability in the 
packet data variants of both the current 2 nd generation of 
cellular telephones, commonly referred to as the general 
packet-data radio system (GPRS) and the TErrestrial 

20 Trunked RAdio (TETRA) standard for digital private and 
public mobile radio systems. 

Furthermore, scalable video system technology may also be 
utilised in the Internet. MPEG-4 may also adopt 
25 watermarking in that particular standard. The 

aforementioned inventive concepts will therefore find 
applicability in, and thereby benefit, all these emerging 
technologies . 



30 



37 



It will be understood that the video transmission and 
watermarking arrangement described above provides the 
following advantages: 

5 (i) allows a determination to be made as to 

whether a scalable video sequence has been tampered 
with; 

(ii) provides a good estimate as to the type of 
10 attack/tampering performed; 

(iii) provides a good estimate of the location of 
the attack (whether it be the whole sequence or a single 
frame) ; 

15 

(iv) provides an efficient mechanism for 
verifying the presence of a watermark without requiring a 
key to extract the watermark, or giving any information 
away about the location or contents of the watermark; and 

20 

(v) supports the 'Public* watermarking system 
concept whereby anyone can verify the presence of a 
watermark, but only the key-holder is able to extract and 
read the watermark. 

25 

In summary, a method for testing a presence of a 
watermark in a scalable video sequence for detecting 
tampering of the scalable video sequence has been 
30 provided. The method includes the steps of receiving a 
scalable video sequence having a plurality of blocks, 



38 



wherein a number of said plurality of blocks include at 
least one watermark coefficient. A discrete cosine 
transform is performed on the received scalable video 
sequence, thereby recovering a plurality of coefficients 
associated with one or more block (s) of the video 
sequence. A number of positive coefficients are compared 
with a number of negative coefficients from the recovered 
plurality of coefficients to determine whether the 
received scalable video sequence has been tampered with. 



Furthermore, a method for classifying a type of watermark 
attack on a scalable video sequence containing a 
watermark has been described. In particular, such a 
method has been described in conjunction with the 
aforementioned method of testing for a presence of a 
watermark. The method includes the steps of performing a 
discrete cosine transform on said received scalable video 
sequence, thereby recovering a plurality of DCT 
coefficients associated with one or more block (s) of the 
video sequence; and classifying a type of watermark 
attack based on a shape of the frequency distribution of 
positive coefficients into at least one of: high-pass, 
low-pass, re-sample/crop and re-sample, or non-linear. 

In addition, a video transmission system has been 
described including means for receiving a watermarked 
video sequence that is operably coupled to means for 
performing a discrete cosine transform function to 
recover a plurality of coefficients. The means for 
performing a discrete cosine transform function is 
operably coupled to means for comparing said recovered 
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coefficients of the watermarked video sequence output 
from the discrete cosine transform function to determine 
evidence of tampering of the watermarked video sequence. 

5 A video communication unit, and a mobile radio device 
incorporating such a unit, adapted to operate in the 
aforementioned video transmission system or perform the 
aforementioned method steps has also been described. 

10 Generally, the inventive concepts contained herein are 
equally applicable to any suitable video or image 
transmission system. Whilst specific, and preferred, 
implementations of the present invention are described 
above, it is clear that variations and modifications of 

15 such inventive concepts could be readily applied by one 
skilled in the art. 

Thus, an improved arrangement for detecting and 
quantifying any tampering of a video sequence or image, 

20 has been provided, whereby the aforementioned 

disadvantages with prior art arrangements have been 
substantially alleviated. Furthermore, if such tampering 
of a video sequence or image was found, a mechanism has 
been provided to localise and classify the tampering 

25 attack. 
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Claims 

1. A method for testing (500, 600, 700) a presence 

of a watermark in a scalable video sequence, for 
detecting tampering of the scalable video sequence, the 

method characterised by the steps of: 

receiving a scalable video sequence (502, 602) , 

having a plurality of blocks, wherein a number of said 

plurality of blocks include at least one watermark 

coefficient; 

performing a discrete cosine transform on said 
received scalable video sequence, thereby recovering a 
plurality of coefficients associated with one or more 
blocks of the video sequence; and 

comparing (506) a number of positive coefficients 
with a number of negative coefficients from said 
recovered plurality of coefficients to determine whether 
said received scalable video sequence (502, 602) has been 
tampered with. 

2. The method for testing a presence of a watermark 

in a scalable video sequence according to claim 1, the 
method further characterised by the discrete cosine 
transform being performed on an enhancement layer of said 
received scalable video sequence. 



3. The method for testing a presence of a watermark 
in a scalable video sequence according to claim 1, the 
method further characterised by the step of: 

performing at least one of a statistical 
mean/variance test (504, 600) and a peak comparison test 
(512, 700) to determine the presence of a watermark. 

4. The method for testing a presence of a watermark 
in a scalable video sequence according to claim 3, by 
performing the statistical mean/variance test (504, 600), 
the method further characterised by the steps of: 

determining first a variance (612) and an 
absolute mean (abs(mean) ) (610) of the negative 
coefficients; and 

determining second a variance (612) and an 
abs(mean) (610) of the positive coefficients; 

wherein if said first and said second determinations are 
equal (614, 616), deciding that the received scalable 
video sequence contained no watermark. 

5. The method for testing a presence of a watermark 
in a scalable video sequence according to claim 3, by 
performing the peak comparison test (512, 700), the 
method further characterised by the steps of: 

grouping (712) the recovered coefficients into a 
frequency distribution; and 

determining (724) that the scalable video 
sequence contained a watermark based on a shape of said 
frequency distribution. 
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6. The method for testing a presence of a watermark 
in a scalable video sequence according to claim 5, the 
method further characterised by the steps of: 

assuming the scalable video sequence contained a 
5 watermark if the determination of positive coefficients 
is greater than said determination of negative 
coefficients; or 

assuming either the scalable video sequence did 
not contain a watermark, or the watermark of the scalable 
10 video sequence had been attacked if the determination of 
negative coefficients is greater than or equal to said 
determination of positive coef f icient s . 

7. The method for testing a presence of a watermark 
15 in a scalable video sequence according to any of claims 4 

to 6, the method further characterised by the step of: 

comparing a result from the peak comparison test 
(512, 700) or the statistical mean/ variance test (504, 
600) to a threshold to determine whether the whole 
20 scalable video sequence has been subjected to a watermark 
attack. 

8. The method for testing a presence of a watermark 
in a scalable video sequence according to any of claims 4 

25 to 7, the method further characterised by the steps of: 
increasing the number of coefficients selected 
from one or more block (s) of the scalable video sequence 
in order to improve an accuracy in the watermark 
determination from the peak comparison test (512, 700) or 

30 the statistical mean/variance test (504, 600) when, for 
example, a re-sampling style attack is suspected. 
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9. The method for testing a presence of a watermark 
in a scalable video sequence according to any of claims 4 
to 8, the method further characterised by the steps of: 

5 performing a bit-by-bit analysis (530) on the 

received scalable video sequence; and 

deciding whether each received bit has been 
correctly received, and at least one of: 

subtracting, if the original watermark is 
10 known, the recovered watermark from the original 
watermark in each frame; 

summing the number of bits in error to 
calculate a bit error rate for the scalable video 
sequence ; 

15 combining a number of recovered watermarks 

within the sequence, if the original watermark was not 
available and each frame in the sequence was watermarked 
with the same watermark pattern, in order to calculate a 
bit error rate for the scalable video sequence; and 

20 subtracting a recovered watermark from each 

repeat of that watermark in a frame, if each frame of the 
sequence was watermarked with a different watermark, to 
calculate a bit error rate for the scalable video 
sequence . 

25 

10. The method for testing a presence of a watermark 
in a scalable video sequence according to any of claims 4 
to 9, the method further characterised by the step of: 

comparing a calculated bit error rate to an 
30 expected bit error rate for each received video frame to 
determine whether the video frame has been tampered with. 
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11. A method for classifying (520) a type of 
watermark attack on a scalable video sequence containing 
a watermark, in particular according to the method for 
testing for a presence of a watermark of claim 7, the 

5 method characterised by the steps of: 

performing a discrete cosine transform on said 
received scalable video sequence (502), thereby 
recovering a plurality of DCT coefficients (508) 
associated with one or more block (s) of the video 
10 sequence; and 

classifying a type of watermark attack (520) 
based on a shape of the frequency distribution of 
positive coefficients into at least one of: high-pass, 
low-pass, re-sample/crop and re-sample, and non-linear. 

15 

12. The method for classifying a type of watermark 
attack on a scalable video sequence according to claim 
11, the method further including the steps of: 

taking each DCT coefficient; 
20 generating a frequency distribution of the DCT 

coefficients; and 

classifying said attack as a high-pass filter 
type attack when the frequency distribution range is 
found to have increased; or 
25 classifying said attack as a low-pass filter type 

attack when the frequency distribution range is found to 
have decreased. 
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13. The method for classifying a type of watermark 

attack on a scalable video sequence according to claim 
11, the method further characterised by the step of: 

classifying said attack as a re-sample/crop type 
5 attack when locations of the peaks in the frequency 
distribution of the coefficients are distributed 
differently to that expected. 



14. The method for classifying a type of watermark 

10 attack on a scalable video sequence according to claim 

11, the method further characterised by the step of: 

classifying said attack as a non-linear attack 

when peaks in the frequency distribution of the 

coefficients tend towards a series of small peaks, with a 
15 primary peak around zero being reduced in height/area to 

that expected. 



15. A video transmission system comprising: 
means for receiving a watermarked video sequence; 
the means .for receiving a watermarked video sequence 
being operably coupled to means for performing a discrete 
cosine transform function, to recover a plurality of 
coefficients ; 

the means for performing a discrete cosine transform 
function being operably coupled to means for comparing 
said recovered coefficients of the watermarked video 
sequence (502, 602) output from the means for performing 
a discrete cosine transform function, to determine 
evidence of tampering of the watermarked video sequence. 

16. A video transmission system according to claim 
15, wherein the video sequence is a scalable video bit 
stream. 



17. A video transmission system according to claim 

16, wherein the scalable video bit stream includes at 
least one of: a combined base layer and an enhancement 
layer of video signals or an enhancement layer only of 
video signals, with at least one watermark coefficient 
being applied to the enhancement layer. 



18. A video communication unit adapted to operate in 

the video transmission system of any of preceding claims 
15 to 17. 



19. A video communication unit adapted to perform the 

method steps of any of preceding claims 1 to 14 . 
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20. A mobile radio device comprising a video 

communication unit in accordance with claim 18 or claim 
19. 

5 21. The mobile radio device of claim 20, wherein the 

mobile radio device is a mobile phone, a portable or 
mobile PMR radio, a personal digital assistant, a lap-top 
computer or a wirelessly networked PC. 

10 22. A method for testing a presence of a watermark in 

a scalable video sequence substantially as hereinbefore 
described with reference to, and/or as illustrated by, 
FIG. 5 or FIG. 6 or FIG. 7 of the accompanying drawings. 
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